Semi-supervised Semantic Role Labeling
نویسنده
چکیده
Most corpus-based approaches to language learning have focused on tasks for which a sufficient amount of human-labeled training data is available. However, it is difficult to produce such data, and models trained from such data tend to be brittle when applied to domains that vary, even in seemingly minor ways, from the training data. We claim that these difficulties can by overcome by applying semi-supervised learning techniques. Semi-supervised techniques learn from both labeled and “raw” data. In our case, the latter is raw text. Several researchers have used semi-supervised techniques for language learning (Nigam et al. 2000; Blum & Mitchell 1998; Joachims 1999; Riloff & Jones 1999), but we believe that this area is not yet well explored and definitely not well understood. Therefore, we present a challenge problem for semi-supervised learning: semantic role labeling and semantic relationship annotation. Semantic role labeling was introduced by Gildea & Jurafsky (2002), and we added semantic relationship annotation in Thompson, Levy, & Manning (2003). This problem is a difficult one for semisupervised techniques, for three reasons. First, there are many possible classes (the role labels) for examples. Second, sequence learning is involved. Third, the learning scenario is plagued by sparse data problems. We describe the role labeling problem, our learning model and its extendibility to semi-supervised learning, and some preliminary experiments. Semantic Role Labeling Extracting semantic meaning from the surface form of sentences is important for many understanding tasks performed by human and machine alike. These include inference, antecedent resolution, word sense disambiguation, or even syntactic tasks such as prepositional phrase attachment. We focus here on a particular type of semantic analysis, that of determining the semantic roles and relationships at play in a sentence. For example, in the sentence ∗Some of this work was supported by an ARDA Aquaint grant while the author was a Visiting Assistant Professor at Stanford University. Carolin Arnold at the University of Utah was crucial in running the EM experiments. Copyright c © 2004, American Association for Artificial Intelligence (www.aaai.org). All rights reserved. “The hammer broke the vase,” we may want to find the participants, or roles, in the breaking action. In this case the roles are instrument, filled by “the hammer,” and patient, filled by “the vase.” We may also care about the over-arching relationship created by these roles and the breaking action, which by the theory we are using is the CAUSE TO FRAGMENT relationship. Having semantic roles allows one to recognize the semantic arguments of a situation, even when expressed in different syntactic configurations. For example, in “I broke the vase with the hammer,” the vase and hammer play the same roles as in the previous sentence. We have developed a generative model for performing this type of semantic analysis. Generative models are probability models representing a joint distribution over a set of variables. The specific probability settings are called parameters. As pointed out by Jurafsky (2003) and others, probabilistic models capture cognitively plausible aspects of human language processing, generation, and learning. Our model takes as input the constituents of a sentence and a predicator word (or phrase) from that sentence. In the previous example broke is the predicator. The predicator takes semantic role arguments, instantiated by the constituents. We learn the parameters for this model from a body of examples provided by the FrameNet corpus (Baker, Fillmore, & Lowe 1998). The problem and some elements of our approach are similar to that of Gildea & Jurafsky (2002), but our work differs by use of a generative, not a discriminative (conditional), model. We also add the inference of the overarching relationship between the roles, called the frame.
منابع مشابه
Semi-Supervised Semantic Role Labeling
Large scale annotated corpora are prerequisite to developing high-performance semantic role labeling systems. Unfortunately, such corpora are expensive to produce, limited in size, and may not be representative. Our work aims to reduce the annotation effort involved in creating resources for semantic role labeling via semi-supervised learning. Our algorithm augments a small number of manually l...
متن کاملSemi-Supervised Semantic Role Labeling: Approaching from an Unsupervised Perspective
Reducing the reliance of semantic role labeling (SRL) methods on human-annotated data has become an active area of research. However, the prior work has largely focused on either (1) looking into ways to improve supervised SRL systems by producing surrogate annotated data and reducing sparsity of lexical features or (2) considering completely unsupervised semantic role induction settings. In th...
متن کاملSemi-supervised Semantic Role Labeling via Graph-Alignment
Semantic roles, which constitute a shallow form of meaning representation, have attracted increasing interest in recent years. Various applications have been shown to benefit from this level of semantic analysis, and a large number of publications has addressed the problem of semantic role labeling, i.e., the task of automatically identifying semantic roles in arbitrary sentences. A major limit...
متن کاملSemi-Supervised Semantic Role Labeling via Structural Alignment
Large-scale annotated corpora are a prerequisite to developing high-performance semantic role labeling systems. Unfortunately, such corpora are expensive to produce, limited in size, and may not be representative. Our work aims to reduce the annotation effort involved in creating resources for semantic role labeling via semi-supervised learning. The key idea of our approach is to find novel ins...
متن کاملSemantic Role Labeling of Chinese Using Transductive SVM and Semantic Heuristics
Semantic Role Labeling (SRL) as a Shallow Semantic Parsing causes more and more attention recently. The shortage of manually tagged data is one of main obstacles to supervised learning, which is even serious in SRL. Transductive SVM (TSVM) is a novel semi-supervised learning method special to small mount of tagged data. In this paper, we introduce an application of TSVM in Chinese SRL. To impro...
متن کاملSemi-supervised Semantic Role Labeling Using the Latent Words Language Model
Semantic Role Labeling (SRL) has proved to be a valuable tool for performing automatic analysis of natural language texts. Currently however, most systems rely on a large training set, which is manually annotated, an effort that needs to be repeated whenever different languages or a different set of semantic roles is used in a certain application. A possible solution for this problem is semi-su...
متن کامل